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Abstract —Cyclic redundancy check (CRC) codes check if a 
codeword is correctly received. This paper presents an algorithm 
to design CRC codes that are optimized for the code-specific 
error behavior of a specified feedforward convolutional code. 
The algorithm utilizes two distinct approaches to computing 
undetected error probability of a CRC code used with a specific 
convoiutional code. The first approach enumerates the error 
patterns of the convoiutionai code and tests if each of them is 
detectabie. The second approach reduces complexity significantly 
by exploiting the equivalence of the undetected error probability 
to the frame error rate of an equivalent catastrophic convolu¬ 
tional code. The error events of the equivalent convolutional code 
are exactly the undetectable errors for the original concatenation 
of CRC and convolutional codes. This simplifies the computation 
because error patterns do not need to be individuaily checked 
for detectabiiity. As an example, we optimize CRC codes for 
a commonly used 64-state convolutional code for information 
length k=1024 demonstrating significant reduction in undetected 
error probability compared to the existing CRC codes with the 
same degrees. For a fixed target undetected error probability, 
the optimized CRC codes typically require 2 fewer bits. 

Index Terms —catastrophic code, convolutional code, cyclic 
redundancy check (CRC) code, undetected error probability. 

I. Introduction 

E RROR-detecting codes and error-correcting codes work 
together to guarantee a reliable link. The inner error- 
correcting code tries to correct any errors caused by the 
channel. If the outer error-detecting code detects any residual 
errors, then the receiver will declare a failed transmission. 

Undetected errors result when an erroneously decoded code¬ 
word of the inner code has a message that is a valid codeword 
of the outer code. This paper designs cyclic redundancy check 
(CRC) codes for a given feedforward convolutional code such 
that the undetected error probability is minimized. 

A. Background and Previous Work 

A necessary condition for a good joint design of error¬ 
detecting and error-correcting codes, both using linear block 
codes, is provided in 0 - However, this condition is based on 
the minimum distances of the inner and outer codes and does 
not consider the detailed code structure. 

Most prior work on CRC design ignores the inner code 
structure by assuming that the CRC code is essentially oper¬ 
ating on a binary symmetric channel (BSC). We refer to this 
as the BSC assumption. The BSC assumption does not take 
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advantage of the fact that the CRC code will only encounter 
error sequences that are valid codewords of the inner code. 

The undetected error probability of a CRC code under the 
BSC assumption was evaluated using the weight enumerator 
of its dual code in |2j. Fast algorithms to calculate dominant 
weight spectrum and undetected error probability of CRC 
codes under the BSC assumption were presented in 0, 0. 

In 0, Koopman and Chakravarty list all standard and good 
(under the BSC assumption) CRC codes with up to 16 parity 
bits for information lengths up to 2048 bits. The authors 
recommend CRC codes given the specific target redundancy 
length and information length. 

For more than 16 CRC parity bits, it is difficult to search 
all possibilities and find the best CRC codes even under the 
BSC assumption. Some classes of CRC codes with 24 and 32 
bits were investigated under the BSC assumption in |6j and 
an exhaustive search for 32-bit CRC codes under the BSC 
assumption was performed later in 0 - 

Because all of these designs ignore the inner code by 
assuming a BSC, there is no guarantee of optimality when 
these CRC codes are used with a specific inner code. 

A few papers do consider CRC and convolutional codes 
together. In the CRC code was jointly decoded with the 
convolutional code and used to detect the message length 
without much degradation of its error detection capability. In 
Q, CRC bits were punctured to reach higher code rates. The 
authors noticed that bursty bit errors caused an impact on the 
performance of the punctured CRC code. 

Recently, fT0| and 0 have considered a CRC code used 
for error correction jointly with convolutional and turbo codes, 
respectively. However, in these cases, the error detection 
capability of the CRC code is degraded. Such undetected error 
probability as well as false alarm probability were analyzed 
in m under the BSC assumption. The authors of 0 also 
modeled the bursty error at the turbo decoder output using a 
Gilbert-Elliott channel. 

B. Mam Contributions 

We propose two methods to compute the undetected error 
probability of a CRC code concatenated with a feedforward 
convolutional code. The exclusion method enumerates possible 
error patterns of the inner code and excludes them one by one 
if they are detectable. The construction method constructs a 
new convolutional code whose error events correspond exactly 
to the undetectable error events of the original concatenation 
of CRC and convolutional codes. With these two methods 
as tools, we design CRC codes for the most common 64- 
state convolutional code for information length k = 1024 
and compare with existing CRC codes, demonstrating the 
performance benefits of utilizing the inner code structure. 
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Fig. 1. Block diagram of a system employing CRC and convolutional codes. 

This paper is organized as follows: Section [II] provides the 
system model. Section |lH|presents the exclusion and construc¬ 
tion methods for computing the undetected error probability 
of a CRC code concatenated with an inner convolutional 
code. Section Uv] describes how these two methods can be 
used to design a CRC code to minimize the undetected error 
probability for a specific feedforward convolutional code and 
information length. Section [Y] applies this design approach to 
the most common 64-state convolutional code for information 
length k = 1024. Section [Vi] concludes the paper. 

II. System Model 

Fig-0 shows the block diagram of our system model em¬ 
ploying a CRC code concatenated with a convolutional code in 
an additive white Gaussian noise (AWGN) channel. The fc-bit 
information sequence is expressed as a binary polynomial /( x) 
of degree smaller than or equal to k — 1. The m parity bits are 
the remainder r(x) of x m f{x) divided by the degree-m CRC 
generator polynomial p(x). Thus the n = k + m -bit sequence 
described by x m f(x)+r(x) is divisible by p(x) producing the 
fc-bit quotient q{x) and a remainder of zero. The CRC-encoded 
sequence can also be expressed as q(x)p(x). Thus q[x) has a 
one-to-one relationship with f{x). Note that q(x)p(x) is the 
result of processing the sequence x m q[x) (q(x) and m trailing 
zeros) by the CRC encoder circuit described by p(x). 

The transmitter uses a feedforward, terminated, rate- -F con¬ 
volutional code having v memories with generator polynomial 
c{x) = [ci(x), c 2 {x), ■ ■ ■ ,Cjv(a:)]. The output q(x)p(x)c(x) 
of the convolutional encoder is sent to the AWGN channel 
using quadrature phase-shift keying (QPSK) modulation. 

Because the CRC bits are added at the end of the sequence, 
the highest degree term of /( x) is the bit that is first in time 
and first to enter the convolutional encoder. Consistent with 
this convention and in contrast to common representations, the 
highest degree terms of c( x) represent the most recent encoder 
input bits. Thus a convolutional encoder with the generator 
G(D ) = [1 + D 3 + D 4 ,l + D + D 2 ] will have c(x) = [x 4 + 
x + 1, x 4 + x 3 + x 2 ], 

The demodulated symbols are fed into a soft Viterbi de¬ 
coder. The CRC decoder checks the n-bit sequence resulting 
from Viterbi decoding. An undetected error occurs when the 
receiver declares error-free decoding when the Viterbi decoder 
identified an incorrect codeword. 

This work can be applied to feedback convolutional codes as 
well. Consider a feedback convolutional code with generator 
polynomial c(x)/ cfb(x), where cfb(x) is its feedback con¬ 
nection polynomial. This feedback code has the same set of 
codewords as a feedforward code with generator polynomial 
c(x). Assume the Viterbi decoder selects a wrong message 
t(x) at the feedforward convolutional decoder output. In most 


cases, the same received signal is decoded as t(x)cpB(x) by 
the feedback convolutional decoder. However, it is possible 
that the message decoded by the feedback decoder is not a 
multiple of cfb(o:). Such trellis deviation must happen during 
the termination of the codeword and corresponds to either a 
long error sequence with large codeword Hamming distance 
or a short error sequence occurring only at the end of the 
codeword. Either of these cases should not dominate the 
overall codeword error performance and thus we only consider 
message errors of the form t(x)cpB(x). 

Let the greatest common divisor of p{x) and cfb(®) be 
c gc d(a;). The polynomial p(x) divides t(x)c^{x) if and only 
if p(x)/c sc d(x) divides t{x). Hence, the undetected error 
probability of this CRC code concatenated with the feedback 
convolutional code is approximated by that obtained by the 
CRC code p(x)/c gc d(x) concatenated with the feedforward 
convolutional code c{x) and can be analyzed using the meth¬ 
ods presented in this paper. Furthermore, a smart choice when 
using a CRC code concatenated with a feedback convolutional 
code is to pick p{x) and cfb(x) relatively prime. 

III. Undetected Error Probability Analysis 

Let e(x) be the polynomial of error bits in the Viterbi- 
decoded message so that the decoded n-bit sequence followed 
by v zeros (for termination) is expressed as q(x)p{x)x"+e(x). 
If e(x) ^ 0 is divisible by p{x), then this error is undetectable 
by the CRC decoder. This section presents two methods, the 
exclusion method and the construction method, to calculate 
the probability that a non-zero error e[x) occurs that is 
undetectable. 

A. Exclusion Method 

The exclusion method enumerates the possible error patterns 
of the convolutional code and excludes the patterns detectable 
by the CRC code. The probability of the unexcluded error pat¬ 
terns is the undetected error probability. The exclusion method 
filters out part of the distance spectrum of the convolutional 
code through a divisibility test to create the distance spectrum 
of the undetectable errors of the concatenated code. 

1) Undetectable Single Error: An error event occurs when 
the decoded trellis path leaves the encoded trellis path once 
and rejoins it once. Let ed,i(x) be the polynomial of message 
error bits associated with the i th error event that leads to a 
codeword distance d from the transmitted codeword, where 
the range of i is later specified in (0). Note that in this paper, 
the term “distance” always refers to the convolutional code 
output Hamming distance. Let this error event have length ld,i- 
Both ed,i{x) and ld } i are obtained through computer search of 
the given convolutional code. The first (highest power) term 
of ed,i{x) is x ld ’ i ~ 1 and the last (lowest power) term is x v 
because every error event starts with a one and ends with a 
one followed by v consecutive zeros. 

If the received data frame contains only one error event, 
then the polynomial of message error bits can be expressed 
as e(x) = x 9 ed,i{x), where g € [0, n + v — ld,i] indicates the 
possible locations where this error event may appear. If ed,i(x) 
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is divisible by p(x), this error event, including all of its offsets 
g , will be undetectable. 

The union bound of such error probability is given by 

oo ad 

?UD,1 < EE I(p(s) I e dti (x)) 

d—d[ Tee i—1 

max{0,n + v — + 1} P(d), (1) 


where df ree is the free distance of the convolutional code, a d is 
the number of error events with output distance d, the indicator 
function I(-) returns one when ed,i(x) is divisible by p(x) and 
zero otherwise, and P(d) is the pairwise error probability of 
an error event with distance d. The max operator ensures that 
the number of possible locations is always nonnegative even 
when ld,i is large. Note that while (JTJ) does not explicitly use 
the generator polynomial of the convolutional code, it does 
implicitly depend on the generator polynomial because the 
generator polynomial determines the valid trellis error events 
6d,i(x). The subscript “ 1 ” in Pud,i means that this probability 
only includes undetectable errors that are single error events. 
We call this type of error an undetectable single error. 

For a QPSK system operated in an AWGN channel, P(d) 
can be computed using the tail probability function of standard 
normal distribution, i.e. Gaussian Q-function, as HI 

p(d) = Q(v / 2dr) < q(\/24^) e-^ d ~ d ^\ ( 2 ) 


where 7 = E s /Nq is the signal-to-noise ratio (SNR) of a 
QPSK symbol, and E s and Nq /2 denote the received symbol 
energy and one-dimensional noise variance, respectively. Note 
that the accuracy of 0 comes in part from a knowledge of 
{(free ■ A useful Q-function approximation when knowledge of 
df r ee is not available is presented in G3- To generalize ^ 
to a higher-order quadrature amplitude modulation (QAM) 
system with bit-interleaved coded modulation using a random 
interleaver, one can multiply 7 with a modulation-dependent 
factor to obtain an approximation. Details can be found in 1141 
as well. 

To compute Pud,i each error event e,i.i must first be 
identified as either divisible by p(x) or not. One approach 
is to truncate 0 at d to get an approximation, in which 
case all error events with distance d < d can be stored 
and this set of error events can be tested for divisibility by 
the CRC polynomial p(x). The choice of d is based on the 
computational and storage capacity available to implement an 
efficient search such as HD- The required memory size to 
store the error events is proportional to Y2 d =d r , ST=i 

The approximation of (|T|i can be quite tight if the probability 
of the terms with d > d is negligible. However, assuming that 
all error events with d > d are undetectable provides 


00 d ad 

Pud,i < E na dP(d)+ y EW 

d—d-\-l d=dfree i =1 

• I (p(x) | e d ,i{x)) max {0, n + v - l dji + 1}}, (3) 

where l d t for d > d is replaced with v + 1 because the shortest 
error event has length v + 1. Note that ([3]) can be computed 
because the error pattern ed,i{x) is only required for d < 
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Fig. 2. An illustration of two error events. 



d, and the distance spectrum a d for d > d provided by the 
transfer function m can be used for the first term of Q as 
in the frame error rate (FER) bounds of ED- 

In addition to the undetectable single error events discussed 
above, an undetectable error could consist of two or more error 
events, even though each of the error events itself is detectable. 
We will first discuss the case with two error events, and then 
generalize it to multiple error events. 

2 ) Undetectable Double Error: A double error involves two 
error events e dlt i 1 (x) and ed 2 ,i 2 {x) with respective lengths 
ld 1 ,i 1 and l d2 ,i 2 ■ To simplify notation, for u £ {1,2} let e u (x) 
and l u refer to e du ,i u {x) and l du ,i M , respectively. In a data 
frame with two error events, the polynomial of error bits in 
the message can be expressed as e(x) = x 9l+92+l2 ei(x) + 
x 92 e 2 {x), where the exponents of two x’s tell the locations 
of the two error events. Furthermore, <71 > 0 represents 
the interval of symbols (gap) between two error events and 
satisfies g 1 + g 2 + h + h < n + v. If x 9l+l2 ei(x) + e 2 (x) is 
divisible by p(x), the error is an undetectable double error. 
Its length is l\ +1 2 + 9\ and its offset is g 2 . 

An upper bound of the probability of an undetectable double 
error occurring in the codeword is given by 

OO OO a di a d 2 n-\~V — l\—l 2 

pud,2< Y E EE E p(d 1 +d 2 ) 

dl=dfree f^2~^free ^1—1 ^2 — 1 Ql —0 

■l(p(x) | x 9l+l2 ei(x) + e 2 (x)) 

■ (n + v - h - l 2 - gi + 1) . (4) 


The distance of the double error event is simply the sum of the 
individual distances because the error events are completely 
separated as shown in Fig. [2] Two error events that overlap 
are simply a longer single error event, which was treated in 
Section IIII-All 

Computation of ([4]) exactly is problematic because it re¬ 
quires infinite search depth. Replacing the l\ and l 2 of large- 
distance terms in (|4} with v+1 yields a more computation- 
friendly upper bound similar to 0 as follows: 

a di a d 2 n-\-V — l\—l 2 

Pud ,2 < Y E E E P(di+d2) 

(d 1 ,d 2 )eD i 2 i 1=1 12=1 Sl =0 

• I (p{x) | x 9l+h e 1 (x) + e 2 (x)) 

■ (n + v — li — l 2 — gi + 1 ) 

+ E \ ( n “ ^ ( n ~ v -!) 

(di ,d 2 )^Dj 2 

■ a dl a d2 P(d 1 + d 2 ), (5) 


where Dj 2 = |(di,d 2 ) 


di,d 2 > dfree.rfi +d 2 < d 


}■ 
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Because computational complexity limits the single error 
event distance we can search, it is feasible to replace d in Dj 2 
with d + df ree - This is not particularly helpful because we have 
already assumed errors with distance d > d are negligible 
or undetectable during the calculation of undetectable single 
errors. We can also replace all i\ and 1 2 in 0 with v + 1 
and provide another upper bound which does not require any 
length information. 

As described in Appendix [A] the number of <j \ values at 
which to check the divisibility of x 9l+l2 ei(x) + e 2 {x) by p(x) 
can be significantly reduced from n + v — l\ — I 2 + 1. 

3) Total Undetected Error Probability: In general, s error 
events could possibly form an undetectable s-tuple error, 
whether each of them is detectable or not. The error bits in 
the message can be expressed as 

x g,+i 

U—1 \u=u+l 


e ( x ) = E 



This combined error will be undetectable if e(x) is divisible by 
p(x). Therefore, the probability of undetectable s-tuple errors 
Pud s can be approximated or bounded in the same way as 
0- For simpler notation, define sets 

S 

du ^ ^free Vli, ^ ^ d u E d 

u= 1 

Is — j is) |iu ^ [1? ^ [I? } 

s s+1 'I 

g u > 0 Vw, E#« < n + v ~Ew- 

u= 1 u= 1 ) 

Note that G s is determined by all d u and i u , and I s is 
determined by all d u . 

Since an undetectable error may consist of any number of 
error events, the probability of having an undetectable error 
in the codeword Pud is upper bounded by Pud,s- Using 

the computation-friendly bound of each term such as (0 and 
(0, we obtain 

OO OO ( / S \ 

pud<£ p >^+E e p Em 

s=l s—2 l (di,,d s )eDj s \u=l / 

• E E l ( p W 1 e(x )) 

(flri,s,-i)6G,_i 

n + v — E lu — E 9u + 1J1 + E 

u —1 u —1 / ) diGDj 1 

■ E Kp( x ) I e i( x )) max{0, n + u — h + 1}}, (7) 

iiEli 


Gs = • ,5s) 


D 


d,s 


— s ( di , ) ds) 


where the composition of e(x) depends on the number of error 
events s as in (0 and Pj g is the sum of probability of all 
s-tuple errors whose distances are greater than d. 

The probability sum of all large-distance s-tuple errors is 



where the combinatorial number represents the number of 
ways to have s length-^ + 1) error events in a lengthen + v) 
sequence. Using 0 P>Ji can be upper bounded by 

P >d -1 < Q(v / 2 rffree 7 ) e df “ 7 i P E ria dl e~ d ^ L ( 9 ) 

[ d i eD d,i J 

where P is defined as P = nT(£), L)| D=e _ 7 L=1 using the 
transfer function m 

00 a d 

T {D,L)= E Y, Dd L ld ' i - (10) 

d=dfree «=1 


Therefore, the sum of P >( j s terms can be upper bounded by 


E p >,: s <E E 


s=l (dp,••• ,d s )^Dj t 


S\ 


IK P E^ 


< Q(x/2d lrC c7) e d ’” 7 E^ 


e n^ e 

(di,— ,ds)$ D d s \u=l 

OO 

Q (y/2d free y S ) e dfr “ 7 E 


—d u 7 




(lla) 


-=T E 

(di,- ,i,)eDj iS \u=l 

Q( e d,ml < e p - 1 


—d u 7 


(Hb) 


-E^r E n 

S=1 (di,*** ,d s )GDj s \u=l 


^d u & 


—du'y 


►■(lie) 


The bound of Gaussian Q-function <0 is used in ( | 11 a| ), and 
the transfer function is used to evaluate the sum of all s-tuple 
errors in ( | 1 1 h[ >. Using 0 and (|1 lc| >, a bound of Pud can be 
calculated. In fact, when an undetectable error occurs in a 
codeword, the receiver may still detect an error if a detectable 
error happens somewhere else in the codeword. Therefore, 
Pud, which is the probability of having an undetectable error 
in the codeword, is an upper bound of the undetected error 
probability. When channel error rate is low, having a detected 
error occur along with the undetected error is a rare event so 
that our bound will be tight. 

Due to the limitation of searchable depth d of error events, 
the exclusion method is only useful when undetectable errors 
with distances d < d dominate. However, this requirement 
could be violated by a powerful CRC code that detects 
the short-distance errors. The next subsection introduces the 
construction method, which allows the search depth to increase 
to distance d > d. 


B. Construction Method 

The construction method utilizes the fact that all unde¬ 
tectable errors at the CRC decoder input are multiples of the 
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Fig. 3. An example of CRC code, original convolutional code, and equivalent 
convolutional code. 

CRC generator p(x). This method constructs an equivalent 
convolutional encoder c eq (x) = p(x)c(x) to isolate these 
errors. The set of non-zero codewords of c eq (x) is exactly the 
set of erroneous codewords (given an all-zeros transmission) 
that lead to undetectable errors for the concatenation of the 
CRC generator polynomial and the original convolutional 
encoder. Thus the probability of undetectable error is exactly 
the FER of c eq (x). 

Fig. [3] shows an example of how c Pq (x) is constructed from 
c(x) and p(x), where q'(x) with m + v trailing zeros is the 
input that generates the undetectable erroneous codeword. The 
input/output behavior of c eq (x) is exactly the same as the 
concatenation of p(x ) and c(x). For p(x) with m memory 
elements and c(x) with v memory elements, c eq (x) has m+v 
memory elements. At a given time, the state of the original 
encoder c(x) can be inferred from the state of c eq (x) because 
the state of c eq (x) contains the last m + v inputs to p(x) 
which are sufficient to compute the last v outputs of p{x), 
which exactly comprise the state of c(x). 

1) States of the Equivalent Encoder: Define the all-zero 
state S z to be the state where all memory elements of the 
equivalent encoder are zero. When the equivalent encoder is in 
S z , then the original encoder is also in its zero state. Avoiding 
trivial cases by assuming that the x m ~ 1 and a; 0 coefficients 
of p{x) are 1 , there are 2 m — 1 equivalent encoder states 
in addition to S z that correspond to the all-zero state of the 
original encoder. To see this, consider the top diagram in Fig. [3] 
in which the equivalent encoder state is shown as the state of 
the CRC encoder extended with v additional memories. Note 
that whatever state the equivalent encoder is in, there is a 
sequence of v bits that will produce v zeros at the output of 
the CRC encoder that will drive the original encoder state to 
zero. The specific set of v bits is a function of the m-bits of 
state in the CRC encoder. Thus for any m-bits pattern there 
is a corresponding (m + v)-b\t state of the equivalent encoder 
that corresponds to the original encoder being in the zero state. 

We call the 2 m — 1 non-zero equivalent encoder states 
for which the original encoder state is zero detectable-zero 
states, forming a set S D , because they correspond to the 
termination of a detectable error event in the original encoder. 
The remaining 2 m+l ' — 2 m states of the equivalent code 
are called non-zero states, forming a set S N , because the 
corresponding states of the original code are not zero. To 
terminate an error event in the original encoder, the trellis of 
the equivalent code transitions from a state in S N to S z or to a 


TABLE I 

An Example of State Types with p(a-) = x 2 + x + 1 and v = 2. 


Time 

State 

Type 

Equivalent Code 

Original Code 

State 

Input 

State 

Input 

0 

S z 

0000 

1 

00 

1 

1 

S N 

0001 

1 

01 

0 

2 

S N 

0011 

0 

10 

0 

3 

S D 

0110 

1 

00 

0 

4 

S D 

1101 

0 

00 

1 

5 

S N 

1010 

0 

01 

1 

6 

S N 

0100 

0 

11 

0 

7 

S N 

1000 

0 

10 

0 

8 

s z 

0000 


00 



state in S D . If the transition is to S D the cumulative errors are 
detectable because the portion of q’{x)p{x) till this moment is 
not divisible by p{x). If the transition is to S D , the cumulative 
errors are detectable because the portion of q'(x)p(x) till this 
moment is divisible by p(x). 

Table [I] shows an example using the set-up shown in Fig. [3] 
with p(x) = x 2 + x + 1 , q'(x) = x 3 + x 2 + 1 , and v = 2 . 
The original encoder c(x) does not need to be specified for 
the results in Table [I] because any feedforward encoder will 
produce the same state sequence. The zero state of the original 
code is visited at time 0, 3, 4, and 8 . At time 0, the state begins 
from S z . At time 3, the first c(x) error event ends. This error 
event is detectable if the codeword ends at time 3 because the 
input to the original code, i.e. x 2 , is not divisible by p(x) . 
The state remains in S D at time 4. At time 5 a second c(x) 
error event begins. At time 8 , the c eq (x) state returns to S z , 
which is not only the end of the second c(x) error event but 
also the end of an undetectable double error. 

Note that c eq (x) is catastrophic because its generator poly¬ 
nomials have a common factor p(x). The catastrophic behavior 
is expressed through the zero distance loops that occur as the 
equivalent encoder traverses a sequence of states in S D while 
the original convolutional encoder stays in the zero state. Thus 
time spent in S D between c(x) error events can lengthen an 
undetectable error without increasing its distance. 

2) Error Events in the Equivalent Encoder: Since the all¬ 
zero codeword is assumed to be sent, the correct path remains 
in S z forever. An error event in the equivalent encoder occurs 
if the trellis path leaves S z and returns S z without any visits to 
S z in between. We will classify these error events according to 
the number of times it enters S D from S N during the deviation. 
If a trellis path enters S D from S N s — 1 times, then it is 
classified as an undetectable s-tuple error. In an undetectable 
s-tuple error, there are exactly s segments of consecutive 
transitions between states in S N , which correspond to s error 
events of the original encoder. These segments are separated 
by segments of consecutive transitions between states in S D . 

Fig. E illustrates an undetectable triple error in a system 
with v = 2 and m = 2. The three error events of the original 
code are separated by visits to S D . The path can leave S D right 
after entering it as shown between the first and the second error 
events; the path can also stay in S D for a while and then leave 
S D from a state different from where it enters S D as shown 
between the second and the third error events. Note that S z is 
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Fig. 4. A trellis diagram of an equivalent convolutional code with v — 2 
and m = 2 having an undetectable triple error. The states are reordered such 
that the all-zero state is at the top and the detectable-zero states are at the 
bottom. 


never directly connected to any state of S D . 

The probability of undetectable single errors is bounded in 
a similar way as 0 by 


ud,i x 


< P 


>< 2,1 






max{0, n + i/ — Z zz + l}P(d), (12) 


d€D ; 


where a zz is the number of error events with distance d 
starting from S z and ending at S z while never traversing a 
state in S D . The length Z zz is the length of the i th error event 
counted in a zz . Note that this expression requires the distance 
spectrum a zz and length spectrum / zz for d £ df lee ,d 
obtained using computer search. 

Unlike 0, ([12} does not need to check the divisibility of 
error events. Hence, there is no need to store the error patterns 
anymore but only their distances and lengths. Consequently the 
search depth d can go beyond d. 

Let Sf be the i th state in S D , where i £ [1,2™ — 1], 
Define A,; as the subset of S D composed of all states con¬ 
nected with Sf through a path that only includes states in 
S D . Appendix [b] shows that A,; and the .x'-cyclotomic coset 
modulo p(x) discussed in Appendix [a] are equivalent. Thus 
if p(x) is a primitive polynomial, all sets A, are identical 
and | A; | = 2 m — 1 is maximized. This may lead to fewer 
undetectable double errors as shown below and is desirable in 
the design of a CRC code. 

Similar to 0, the probability of undetectable double errors 
can be bounded by 


2 m —1 


ZD, <p DZ ,ib 
a, a j 
“1 “2 


UD,2 


< P 


>d, 2 


E E E E 


=1 S° GA 0 *1 = 1 *2=1 


(d 1 ,d 2 )CDj 2 

L(n+i/-r in )/|A^|J 

^ ( n + z/ _rin-|A^|f 1 + l), (13) 


where 


£i=0 


/min _ 7ZD,0 . , r 

1 ~ l 'd 1 ,ii l d 2 ,i 2 


(14) 


is the shortest possible length of the undetectable double error 
specified by (di, d 2 , <j>, 


In ( [13} , 0 is the index of the state at which the trellis enters 
S D at the end of the first error event, and p: is the index of 
the state at which the trellis leaves S D at the beginning of the 
second error event. In ( [14} , < A, ;j is the number of hops 

required to go from to S° without leaving S D for S° € A^, 
where 8 ,p^ = 0 if 4 > — The number of error events starting 
at S z and ending at S° with distance d\ is a™’^, and the i\ h 
error event of them has length , where both numbers are 
obtained by computer search. The variables and 

are defined in a similar way while the error event starts in 
and ends in S z . Furthermore, t\ specifies the number of cycles 
the trellis stays in A, 3 and its upper limit makes sure that the 
total length of the undetectable double error does not exceed 
n + v, the number of trellis stages in the codeword. As in 0 
^><2 2 can °l tcn he neglected because terms with d±+d 2 > d 
have negligible probability. 

Although the number and lengths of error events connecting 
S z and states in S D are obtained by computer searches, the 
required search depth is only d — df lee . Moreover, « ZD ' r ' and 
f z °'f for all <f> can be found while searching for a zz and ( zz 
because these two types of error events both start from S z . 
Regarding and , the associated error events start 

from 2™ — 1 different states and can be found through 2 m — 1 
separated searches. Nevertheless, since these error events end 
at S z , only one backward search is necessary to capture all 
of them. In the backward search, bits in shift registers move 
backward. One can simply treat x as x~ x in polynomial 
representations and apply the same search algorithm. 

3) Undetected Error Probability: As shown in Fig. [4j an 
undetectable s-tuple error is composed of several parts: one 
error event from S z to a state in S D , s — 2 error events from a 
state in S D to a state in S D with visits to S N in between, the 
final error event from a state in S D to S z , and also s — 1 paths 
inside S D connecting consecutive error events. Note that in the 
example of Fig. [4] where s = 3, where the path connecting 
the first and the second error events has a length of zero. Let 
<p u and if) u be the indices of the start and end states of the u th 
transitions in S D , respectively. In Fig. [ 4 ] we have &-[ = M:-\ = 2 
for the first transition; the second transition starts from S^ 2 and 
ends at S° 2 , where 62 = •’> and ' 1 P 2 = 1 ■ Also, let the number of 
error events started at S° u _ 1 and ended at with distance d u 
be an£ | ^ i en gth of the ( u th error event of them 

be where both numbers are obtained by computer 

search. Although we need to perform 2™ —1 separated searches 
to obtain all and a search depth of 

d — (s — 1) dfree is sufficient. 

To simplify the notation, define the following sets: 


ML 

1 ; 


T s 


{((/>!, • • • , <j) a ) \ 4 >u G [1, 2 m - 1] [1, 3] } 


{(V’ti • 

•• ,1ps) 

\i/j u £ A 0u Mu £ [1, s]} 

{(*!>■ 

■ ,is) 

ii £ 

1 a 20 ’^ 1 
a d 1 

, is G 

1 

L a d 3 



iu £ 


1 DD,i/>„_i,0 u 

a d u 


Vti £ [2, s — 1] j 


,t s ) 


t u >0 Mu £ [ 1 , s], 
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E I I t u <n + v -l 


s+l (’ 


where / r s ,lln is the shortest possible length of the undetectable 
s-tuple error specified in a similar way as ( [14} and given by 

S —1 S —1 

/min /ZD,, \ ^ . \ r r /i 4r\ 

^ - L d 1} n + Z^ L d u ,iu +l d a ,i s +Z^ d ^^- (15) 

u=2 n=l 

Similar to 0, the probability of having an undetectable 
error is now bounded by 

„zz 

d l 

Pud < E E max {°^ + "-^ + 1 } P ( rfl ) 

d i eD d,i i=1 

OO / S \ 

+E E p E*. E 

s=2 (d,,... ,d a )£D d - s \«=1 / A_i)6^,_i 

E E E 

(*,-A-i)6^-i (*i,-,*s)el' (ii,--- ,t,-i)£T,_i 

( s— 1 \ OO 

n + v~ lf n - E II ^ + 1 +E P >c*V ( 16 > 

U—l ) S=1 

The last term is the probability sum of all large-distance errors, 
which are assumed to be undetectable, and can be calculated 
using dllc| >. By letting P g g = 0, we obtain an approximation. 
By letting all l zz , l ZD , l DD , and l DZ equal to zz + 1, we obtain a 
looser bound which does not require any length information. 
These two techniques are applicable to every P ( ; Ds , including 
([I2]i and ([13}. 


C. Comparison 

The main benefit of the construction method is that it is 
often able to search deeper than the exclusion method because 
the output pattern is not required. However, the required 
memory size scales with the number of states 2 m+l ' rather 
than 2 " so this approach can encounter difficulty in analyzing 
high-order CRC codes. In contrast, the error events searched 
in the exclusion method belong to the original convolutional 
code, whose number of states is just 2 " and is independent of 
the degree of the CRC code. In fact, both methods create the 
dominant parts (till d or d) of the distance spectrum of the 
equivalent catastrophic convolutional code with finite length. 


As explained in Section IV we found it useful to draw on both 
approaches as we searched for optimal CRC polynomials for a 
specific convolutional code. The exclusion method is suitable 
for short-distance (d < d) undetectable single errors, and the 
construction method is preferred when we search for longer 
undetectable single errors and double errors. 

Fig. 0 compares the simulated undetected error probability 
to the bounds produced by the exclusion and construction 
methods. We consider the CRC code p(x) = x 3 + x + 1 
concatenated with the memory size v = 4 convolutional code 
with generator polynomial (23,35)s hr octal and df ree = 7. 
The information length is k = 1021 bits and thus the CRC 
codeword length is n = 1024 bits. The FER of this original 
convolutional code is plotted as a reference. The equivalent 



Fig. 5. A comparison of the simulated undetected error probability with the 
simulated frame error rate of the equivalent code and the analyses from the 
exclusion and construction methods. The system is equipped with the CRC 
code p(x) = x 3 + x -\- 1 and the convolutional code (23,35)8- The CRC 
codeword length is n = 1024 bits. 


catastrophic convolutional code is (255,317)g and its FER is 
also simulated. 

We can see from Fig. [5]that the undetected error probability 
is upper bounded by the FER of the equivalent code, which 
equals Pud- the probability of having an undetectable error 
in the CRC codeword, and this bound gets tighter as SNR 
increases. The equivalent code FER is above the probability 
of undetected error because it is possible that a frame has 
both an undetectable error event and a detectable error event, 
which causes a frame error in the equivalent code but does 
not cause an undetected error in the concatenated CRC and 
convolutional codes. 

The upper bounds of Pud are computed using the exclusion 
method 0 and construction method ( p~ 6 ] >. In our calculation, 
the search depth limit of the exclusion method is d = 14, 
and d = 20 is the depth limit for the construction method. 
Since c/| rcc = 7, only undetectable single and double errors 
are considered. The probability sum of all large-distance terms 
given by ( |1 lc| > are plotted to verify that they are negligible, 
except for the exclusion method at SNR below 0.75 dB. 
Although the construction method used a deeper search, it 
is still quite close to the exclusion method even in the low 
SNR region. It can be seen that these analysis methods deliver 
accurate bounds at high SNR for both the undetected error 
probability and the FER of the catastrophic code. 

IV. Search Procedure for Optimal CRC Codes 

In this section, we will present an efficient way to find 
the optimal degree-m CRC code for a targeted convolutional 
code and information length k. Note that the performance 
of a CRC code depends on the information length j5). A 
CRC code may be powerful for short sequences but have 
numerous undetectable long errors that are produced by a 
specific convolutional code. 

Since the coefficients of x m and x° terms are both one, 
there are 2 m_1 candidates of degree-m CRC generator poly¬ 
nomials p(x). In principle, either the exclusion method or 










the construction method can produce the undetected error 
probability for each candidate allowing selection of the best 
p(x). However, this process is time-consuming if to is large. 
Both exclusion and construction methods need to compute the 
distance spectrum of undetectable errors up to some distance d. 
We can reduce the computation time by skipping the distance 
spectrum searches of suboptimal CRC codes. 

When the FER is low, the undetectable error rate of a CRC 
code is dominated by the undetectable errors with the smallest 
distance. Let the smallest distance of the undetectable errors be 
d m in- We can evaluate a CRC-convolutional concatenated code 
by its distance spectrum at around d m i n . To be more precise, a 
polynomial should be removed from the candidate list if it has 
a smaller c/ niin than the others or if it has more undetectable 
errors associated to the same r/ lt(ill . In a convolutional code, 
since the number of error events ad grows exponentially as the 
associated distance d increases, the cost to find all undetectable 
errors grows exponentially as well. Hence, the CRC code 
search starts with d = df lee and updates the candidate list by 
keeping only the CRC generator polynomials with the fewest 
undetectable errors. Next, repeat the procedure with the next 
higher d until only one polynomial remains in the list. 

When d < 2df ree , only single errors are possible. The 
exclusion method can count the number of undetectable single 
errors of each candidate when d < d. We can perform a 
computer search for error events of the original code and check 
the divisibility of each of them. Note that if an error event is 
found to be undetectable, all of its possible offsets in the code¬ 
word should be counted. Since all candidates check the same 
set of error events, only one computer search with multiple 
divisibility checks (one for each CRC code) is sufficient. In 
contrast, using the construction method requires construction 
of equivalent encoders for each candidate separately. Hence, 
for the initial values of d near df ree , checking the divisibility 
via the exclusion method is preferred. Of course, once d > d, 
searching for undetectable single errors of the equivalent codes 
as in the construction method is the only approach. 

When d > 2df ree , undetectable double errors need to be 
considered in addition to single errors. The divisibility test 
should be applied to all combinations of error event patterns 
ei(x), ez^x), and their gaps g\. Even if the concept of 
cyclotomic cosets discussed in Appendix [A] is utilized, we 
still need to construct all cyclotomic cosets through about 
2 m divisions and also check if each of the remainder of 
ez(x) divided by p(x) is in the same cyclotomic coset as the 
remainder of e 1 (x) divided by p(x). 

Alternatively, undetectable double errors can be directly 
created using the construction method. In the construction 
method, the error events connecting S z and states in S D 
with distances between df ree and d — df ree are found through 
computer searches. In fact, these events can be generated using 
the detectable error events of the original code previously 
found by exclusion if d — df ree < d. For example, since the 
detectable error pattern e±(x) is known, the corresponding 
error event in the equivalent encoder trellis starts from S z 
and traverses the trellis with the input sequence given by the 
quotient of x m ei{x) divided by p(x). The state S^, where it 
ends, is thus determined by the last to + v input bits, and its 


length / ZD 'i’ has already been provided by the degree of e\ (x). 

For error events starting from states in S D and ending in S z 
with pattern e 2 (x) previously obtained by exclusion, traverse 
the trellis in reverse from S z with the input sequence given by 
the quotient of x m+,y+l e 2 (x~ 1 ) divided by x m p(x~ 1 ), where 
V = — 1 is the degree of e 2 (x). Note that x l e 2 (x~ 1 ) and 

x' n p(x ' ) are the reverse bit-order polynomial representations 
of 62 {x) and p{x), respectively. The state S° where the error 
event begins is determined by the last m + v input bits in 
reverse order. 

According to the discussion at the end of Appendix |A| d m ; n 
is not likely to be much greater than 2df ree when information 
length k is long enough. In other words, the CRC code search 
algorithm is usually finished before reaching 3df ree and does 
not need to count the number of undetectable triple errors. 


V. CRC Design Example for v = 6, k = 1024 


As an example we present the best CRC codes of degree 
to < 16 specifically for the popular memory size v = 0 
convolutional code with generator polynomial (133,171)8 
with information length k = 1024 bits. Note that the proposed 
design method is applicable to all convolutional codes and 
information lengths, and not limited to the choices used for 
this example. The corresponding undetected error probability 
is also calculated and compared with existing CRC codes. 

Table |II1 shows the standard CRC codes listed in 0 and the 
best CRC codes found by the search procedure in Section IV 
For degrees with no standard codes, those recommended by 
Koopman and Chakravarty in [5j are listed and called K&C. 
The notation of generator polynomials is in hexadecimal as 
used in {5j|. For example, CRC-8 has generator polynomial 
X s + x 7 + x 6 + x 4 + x 2 + 1 expressed as OxEA, where the 
most and least significant bits represent the coefficients of x 8 
and x 1 terms, respectively. The coefficient of x° term is always 
one and thus omitted. 

Table [II] also gives the distance spectrum of undetectable 
single errors a zz of each CRC code up to d = 22. The distance 
spectrum of the original convolutional code ad is given as a 
reference. Note that, since this convolutional code has df lee = 
10 , a smaller a| z or aff does not mean fewer undetectable 
errors at distance d = 20 or 22. Undetectable double errors 
should also be counted for d > 20 to judge a candidate. 

During the search for the best CRC codes with degrees 
to < 11 , only single errors need to be considered because one 
candidate will outperform all the others before looking at d = 
20. Although the best degree-11 CRC code has d m in = 20, all 
the other candidates have < 20 and are eliminated before 
the end of the d = 18 round. Since the lengths of single errors 
for d < 20, ranging from 7 to 43, are much shorter than n+ 
v, a candidate that has fewer types of dominant undetectable 
error events will have fewer dominant undetectable errors in 
total. In other words, when undetectable single errors dominate 
and information length k is long enough, the best CRC code 
should possess the smallest a zz . 

When d m j n > 2df ree , the dominant undetectable errors 
include double errors. In this case, a smaller a zz does not 
mean a better code because it only considers single errors. 
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TABLE II 

Standard CRC Codes or CRC Codes Recommended by Koopman 
AND CHAKRAVARTY (K&C) (5j, AND THE BEST CRC CODES FOR 
Convolutional Code (133,171)8 with k = 1024 Bits. 



Gen. 

Undetectable Single Distance Spectrum a zz 

Name 

Poly. 

d 

10 

12 

14 

16 

18 

20 

22 

K&C-3 

0x5 

1 

5 

19 

170 

941 

5050 

29290 

Best-3 

0x7 

0 

7 

24 

169 

879 

5111 

29363 

CRC-4 

OxF 

1 

2 

11 

79 

464 

2504 

14719 

Best-4 

OxD 

0 

1 

17 

91 

462 

2537 

14674 

CRC-5 

0x15 

1 

2 

9 

52 

267 

1378 

8005 

Best-5 

0x11 

0 

0 

4 

52 

230 

1257 

7275 

CRC-6 

0x21 

0 

1 

4 

21 

124 

572 

3659 

Best-6 

0x29 

0 

0 

1 

22 

124 

641 

3650 

CRC-7 

0x48 

0 

0 

1 

14 

55 

298 

1877 

Best-7 

0x47 

0 

0 

0 

7 

70 

322 

1867 

CRC-8 

OxEA 

0 

0 

0 

4 

36 

174 

871 

Best-8 

0x89 

0 

0 

0 

1 

29 

177 

938 

K&C-9 

0x167 

0 

0 

0 

4 

13 

73 

477 

Best-9 

0x177 

0 

0 

0 

0 

14 

104 

437 

CRC-10 

0x319 

0 

0 

0 

1 

8 

41 

239 

Best-10 

0x314 

0 

0 

0 

0 

3 

49 

223 

CRC-11 

0x5C2 

0 

0 

0 

0 

7 

17 

107 

Best-11 

0x507 

0 

0 

0 

0 

0 

24 

113 

CRC-12 

0xC07 

0 

0 

0 

0 

3 

12 

48 

Best-12 

OxAlO 

0 

0 

0 

0 

0 

4 

66 

K&C-13 

Ox102A 

0 

0 

0 

0 

1 

7 

36 

Best-13 

0x1EOF 

0 

0 

0 

0 

0 

1 

29 

K&C-14 

0x21E8 

0 

0 

0 

0 

1 

2 

15 

Best-14 

0x314E 

0 

0 

0 

0 

0 

0 

11 

K&C-15 

0x4976 

0 

0 

0 

0 

1 

1 

6 

Best-15 

0x604C 

0 

0 

0 

0 

0 

0 

3 

CRC-16 

OxAOOl 

0 

0 

0 

0 

0 

1 

3 

Best-16 

Ox8E61 

0 

0 

0 

0 

0 

0 

1 

Original Distance Spectrum 

11 

38 

193 

1331 

7275 

40406 

234969 


For example, the degree-16 polynomials 0xF8Fl and 0x8E61 
both have d ui \ n = 22. The former has off = 0 while the latter 
has off = 1. However, at d = 22, the former has so many 
(2860) undetectable double errors that the number is greater 
than the total count of undetectable single and double errors 
(1011 + 1424) of the latter, when the information length is 
k = 1024 bits. However, when k = 512 bits, the former has 
fewer undetectable errors and becomes optimal. 

Therefore, different information lengths may lead to differ¬ 
ent optimal CRC designs. The authors of |5] have included the 
information length k as a key design parameter and proposed 
a methodology to determine “good” CRC codes for a range 
of k. The same rule can be applied here. First, find the 
CRC polynomials possessing the largest d u ,\ u for the longest 
k. Then, consider shorter k and keep the CRC polynomials 
having the largest d m i n , which might increase as k decreases. 
Table [ill] shows the best CRC codes for k = 256, 512, or 
1024 bits and identifies the “good” codes for this range of 
information lengths. The bold numbers indicate the k for 
which the code is designed. For example, the code OxAlO 
is the best degree-12 code at lengths k = 256 and 1024 bits. 
Note that the codes with degrees m < 11 are not shown since 
undetectable single errors dominate and thus the best CRC 
codes for these k are identical. In fact, the best CRC code 
for the largest k is usually “good” for shorter k. In our case. 


TABLE in 

The Best CRC Codes for Convolutional Code (133,171)s with 
k = 256, 512, or 1024 Bits. Bold Numbers Indicate the k for 
which the Code is Designed. The “Good” Codes for This Range 
of k are Specified. 


Degree 

Gen. (dmin? count of undetectable errors at d m i n ) 

Poly. k 

256 bits 512 bits 1024 bits 

12 

OxAlO (good) 
0x8DC (good) 

(20,1664) (20. 5525) (20,17732) 
(20,1904) (20, 4748) (20,19283) 

13 

0xl8F6 (good) 
0x1 EOF (good) 

(20, 169) (20, 1474) (20, 7452) 
(20, 289) (20, 1187) (20, 5301) 

14 

0x2E20 

0x314E (good) 

(22,3196) (20. 520) (20, 2056) 

(22,4698) (22,12324) (20, 198) 

15 

0x6D80 

0x76AD 

0x604C (good) 

(22, 962) (20, 253) (20, 765) 

(22,1210) (22, 2808) (20. 1382) 
(22,1767) (22. 4414) (22,13329) 

16 

0xA219 

0xF8Fl (good) 
0x8E61 

(24,7396) (22, 316) (20, 454) 

(24,9823) (22, 219) (22, 2860) 

(22, 243) (22, 629) (22, 2435) 



Fig. 6. The undetected error probability of the existing and best CRC codes 
for convolutional code (133,171)s with information length k = 1024 bits 
computed using the construction method. 

the best k = 1024 CRC codes have no more than O.ldB loss 
compared to the best k = 256 and the best k = 512 CRC codes 
at information lengths k = 256 and 512 bits, respectively, 
except for the degree-16 codes. 

In Fig. [6] the bounds of undetected error probability of the 
existing and best CRC codes for information length k = 1024 
bits are shown. For clarity, only even degrees of the CRC codes 
are displayed. The upper bound of the original convolutional 
code FER without any CRC code, calculated using transfer 
function techniques 03- is plotted as a reference. In our 
calculation, the search depth limit of the exclusion method is 
d = 22 and not enough for high degree CRC codes. Therefore, 
the construction method ( fl6l > was used with d = 28. 

The probability sum of all large-distance terms calculated 
using ( ] 1 lc| ) is also plotted to illustrate that the large-distance 
terms really are negligible even assuming they are all un¬ 
detectable, except for the best degree-16 CRC code at SNR 
below 7 dB and some other codes at SNR below 6 dB. Note 
that since the operation e p — 1 in ( file) causes non-negligible 
rounding errors in the high SNR region, it was approximated 
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Fig. 7. The undetected error probability of the existing and best CRC codes 
for convolutional code (133,171)8 with information length k = 1024 bits 
at SNR = 8 dB. 


by the first ten terms, which results a similar expression as 
( |1 lb| ) but only carried out for 1 < s < 10. 

Since d < 3df ree , it is not necessary to evaluate triple or 
higher order errors under this truncation. It is clearly seen from 
the figure that the best degree-??! CRC codes found by our 
procedure outperform the existing degree-77! codes for all to. 
Furthermore, their performance is either better than or similar 
to the existing degree-(ro + 2) code except for to = 6. In 
other words, the proposed design can typically save 2 check 
bits while keeping the same error detection capability. 

We can compare the error detection capability of all codes 
at a fixed SNR. If we draw a vertical line at SNR = 8 dB 
on Fig. [6] the intersections can be plotted along with the 
associated CRC lengths m in Fig. [7] The largest reduction of 
undetected error probability is about five orders of magnitude 
at to = 5 where the existing CRC code has an undetected error 
probability of 1.01 x 10 -12 and the newly designed CRC code 
has an undetected error probability of 1.36 x 10 17 , based 
on the analysis. Note that these numbers are calculated at 
8 dB SNR, and the reduction gets more significant as SNR 
increases. 

Since the existing CRC codes are not tailored to the 
convolutional code, a higher degree code does not necessarily 
have a better performance. We observe that there are three 
almost horizontal region for the existing CRCs with degrees 
to ranging from 3 to 5, from 8 to 9, and from 13 to 15. The 
reason is that the codes in each region have similar number 
of dominant undetectable errors as can be seen in Table [TI] In 
contrast, the CRC codes designed using our procedure show 
steady improvement as the degree increases. 

The existing and best CRC codes can also be compared 
in terms of the required CRC length to achieve a certain 
undetected error probability. Assume our target is to reach 
undetected error probability below 10~ 25 . This can be shown 
by drawing a horizontal line at the target probability level on 
Fig. 0 and plotting the CRC lengths associated to the crossed 
points as a function of SNR as in Fig. [8] For most of the 
SNR levels, the best CRC codes requires two fewer check 
bits than the existing CRC codes to achieve the same error 
detection capability. At SNR around 10.5 dB, the best CRC 
code requires three fewer check bits than the existing CRC 
code. Since the existing code required six check bits, this is a 
50% reduction. 



Fig. 8. The required CRC lengths for the existing and best CRC codes for 
convolutional code (133,171)g with information length k = 1024 bits to 
achieve undetected error probability below 10 —25 . 

VI. Conclusion 

A good CRC code in a convolutionally coded system should 
minimize the undetected error probability. To calculate this 
probability, two methods based on distance spectrum are 
proposed. The exclusion method starts with all possible single 
and multiple error patterns of the convolutional code and 
excludes them one by one by testing if they are detectable. 
In the construction method, undetectable errors are mapped to 
error events of an equivalent convolutional code, which is the 
combination of the CRC code and the original convolutional 
code. The computer search for error events in the construction 
method does not need to record the error patterns and thus 
can go deeper than the search in the exclusion method. 
However, the construction method could encounter difficulties 
while dealing with high-degree CRC codes. Moreover, the 
construction method is generally applicable to the performance 
analysis of catastrophic convolutional codes. 

We also propose a search procedure to identify the best CRC 
codes for a specified convolutional encoder and information 
length. A candidate CRC code is excluded if it has more 
low-distance undetectable errors. Therefore, the best CRC 
polynomial is guaranteed to have the fewest dominant unde¬ 
tectable errors and minimizes the probability of undetected 
error when SNR is high enough. When undetectable double 
errors dominate, the choice of the best CRC polynomial is 
more dependent on the information length. In an example 
application of the design procedure for the popular 64-state 
convolutional code with information length k = 1024, new 
CRC codes provided significant reduction in undetected error 
probability compared to the existing CRC codes with the same 
degrees. With the proposed design, we are able to save two 
check bits in most cases while having the same error detection 
capability. 

It is an open problem to generalize this work to other error- 
correcting codes, such as turbo or low-density parity-check 
(LDPC) codes. The construction method can only be applied 
when the encoder has the convolutional structure. We note 
that convolutional LDPC codes have such a structure. For 
turbo codes, the construction method can be used to analyze 
one of the recursive systematic convolutional (RSC) codes but 
analyzing the other RSC code is not straightforward due to 
the existence of the interleaver. Nevertheless, the exclusion 
method can be generalized to any other error-correcting codes 



















11 


if the dominant error events are identified. 


Appendix A 

Efficient Search for Undetectable Double Errors 
In <[4]» and (|5j, the indicator function is evaluated for every 
g i given ei(x) and e 2 (x). However, there is a more efficient 
way to check the divisibility and can save a lot of computation 
time when n + v is large. 

The remainder of any polynomial in GF(2)[x] divided by 
p(x) forms a quotient ring GF(2 )\x\/p{x). Two polynomials 
mapped to the same element of the quotient ring are called 
congruent. If x 9l+l2 e\(x) is congruent to e 2 (x) modulo p(x), 
then their combination forms an undetectable double error. To 
characterize the remainder of x 9l+l2 ei(x) modulo p(x), we 
apply the concept of cyclotomic coset |18|, which is origi¬ 
nally defined for integers, to polynomials. For polynomials in 
GF(2)[x], define x-cyclotomic coset modulo p(x) containing 
e(x) as 

C e(x) = {x h e{x) (mod p(x))\h = 0,1, • • • } , (17) 


which includes the remainder of all possible offsets of e(x) 
divided by p[x). One can verify that two cyclotomic cosets 
are either identical or disjoint, so all of the distinct x- 
cyclotomic cosets modulo p(x) form a partition of quotient 
ring GF(2)[x]/p(x). For example, consider a degree-2 primi¬ 
tive polynomial p(x) = x 2 +x + l, and we have C 0 = {0} and 
Ci = {l,x,x + l} forming a partition of GF(2)[x]/p(x); 
consider a degree-2 non-primitive polynomial p(x) = x 2 + 1, 
and we have Co = {0}, Ci = {l,x}, and C x +i = {x + 1} 
forming a partition of GF(2)[x]/p(x). In both cases, Co 
is trivial since it only contains the “zero” element. In the 
primitive case, Ci is the only non-trivial cyclotomic coset and 
its cardinality is |Ci| = |GF(2)[x]/p(x)| — 1 = 2 m — 1. In the 
non-primitive case, there are multiple non-trivial cyclotomic 
cosets and their sizes are smaller than 2 m — 1. In fact, there 
is only one unique non-trivial cyclotomic coset if p(x) is a 
primitive polynomial. 

It is obvious that if ei(x) and e2(x) belong to different 
cyclotomic cosets, there is no way to have a g± that makes 
x 9l+l2 ei(x) congment to e2(x) modulo p(x). In other words, 
it is unnecessary to check whether any g\ creates an un¬ 
detectable double error with the specific ei(x) and e2(x). 
If ei(x) and e2(x) belong to the same cyclotomic coset 
C ei(x ), only one proper g x £ [0, |C ei(x) | - l] can make 
x 9 i +i2 e 1 (x)+e 2 (x) divisible by p(x). Denote this particular g\ 
as g[. Once we find g\. all possible g\ that create undetectable 
double errors are just gi = g[ + u |C ei ( x )| for non-negative 
integer u satisfying g\ + l\ + l% < n + v. 

Note that when ei(x) = e2(x), they will belong to the same 
cyclotomic coset no matter what CRC generator polynomial 
p[x) is. That is to say, no CRC code is able to detect such dou¬ 
ble error if these two error events have a proper gap g\. Fortu¬ 
nately, the smallest proper gap is g[ = —12 (mod |C ei ( x ) |) so 
the total length of the undetectable double error is | C ei ( x ) \+h- 
Hence, when n + v is small enough, such double error will 
never occur. On the other hand, when n + v is large, d tlliri , 
which is the shortest distance of the undetectable errors, will 
be upper bounded by 2df ree . 


Appendix B 

The Relationship between A * and C e ( x ) 

Define the state of the equivalent code at time n—g as q' g {x), 
which is a polynomial with maximum degree m + v — 1 repre¬ 
senting consecutive m+v bits from the x 9+rn+v ~ 1 term to the 
x 9 term in x m q'(x ) for g £ [— v, n]. The corresponding state 
of the original code is given by the coefficients of the terms 
from x m+,2 ~ 1 to x m in polynomial q' g {x)p[x). Fet q' g u and p u 
be the coefficients of x“ in q' g (x) and p(x), respectively. Then 
the coefficient of x u for u £ [m,m + v — 1] in q' g (x)p(x) is 
given by 

m 

^ ^ Qq,u—v Pv ' (18) 

v—0 

Assume that the all-zero codeword is sent, i.e. q(x) = 0, and 
the trellis path enters a detectable-zero state Sf at time n — g. 
Also, let e(x) be a polynomial of degree smaller than or equal 
to n — g— 1 representing the length-(n g) input sequence of 
the original convolutional encoder from the beginning to time 
n — g, and it is given by the coefficients from the x n ~ 1 term 
to the x 9 term in q'(x)p(x). Since the state at time n — g is 
S°, e(x) must be non-divisible by p{x). 

The remainder of e(x) divided by p(x) is given by 

m —1 m 

e(x) (mod p(x)) = a: “ J2 %,m+u-vPv, (19) 

u—0 v=u-\-1 

which is totally governed by q' g (x), or Sf. If the remainder 
is known, the bits q g v for v £ [0, m — 1] can be solved 
uniquely through back substitution for u = to— 1, to—2 , • • • , 0 
because p m = 1. Furthermore, the whole polynomial <40*0, 
or S(\ can be solved by letting equal to zero for 

u = m + v — 1 ,to + h— 2, ••• , to because the state of the 
original code is just v zeros. Hence, the remainder of e(x) 
divided by p{x) determines a detectable-zero state S'\ and 
vice versa. Furthermore, each of the 2 m — 1 non-zero elements 
in GF(2)[x]/p(x) corresponds to a unique state in S D . 

To find all states in A,, we can specify q' g _ h 0 , the input bit 
to the equivalent encoder at time n — g + h, for h = 1, 2, • • • 
such that the input bits to the original convolutional encoder 
after time n—g are all zeros and thus the following states are in 
S D . By doing so, we know that the polynomials will 

represent the states in A*. This procedure is finished when 
certain *g-h( x ) represents Sf again. During this procedure, 
the corresponding input sequence to the original encoder from 
the beginning to time n — g + h is simply x h e(x) because the 
input bits after time n — g are all zeros. By the definition given 
in the remainder of x h e(x ) divided by p[x) is an element 

of C e ( x ). In addition, we know that this remainder corresponds 
to a state in A,;. Therefore, A. ( and C contain the same 
elements but just represented in different forms. 
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